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Abstract 

■ We obtain the first term in the high signal-to-noise ratio (SNR) expansion 
, of the capacity of fading networks where the transmitters and receivers — while 
I fully cognizant of the fading law — have no access to the fading realization. 

■ This term is an integer multiple of log log SNR with the coefficient having a 

^ . simple combinatorial characterization. 
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1 Introduction 



^ I In this paper we consider a discrete-time vector fading channel, where the transmit- 

ted vector suffers from both multiplicative and additive noises. The multiplicative 
noise takes the form of a matrix-valued stationary and ergodic process that multi- 
. plies the transmitted vector, and the additive noise takes the form of independent 

00 ! and identically distributed (IID) isotropic Gaussian vectors. We only consider the 



case where neither the realization of the additive noise nor of the multiplicative 
noise is known to the transmitter and receiver; only their probability laws are given. 
^ The mathematical model that we address is thus very similar to the "non-coherent" 

Q . flat-fading multiple-antenna channel model. 

! There is, however, an important difference. In the multiple-antenna channel 

^ ' model we think of the components of the transmitted vector as being the signals 

transmitted by co-located antennas. Similarly, the components of the received vec- 
^ ' tors are viewed as the signals received at co-located antennas. Our model is more 

general. We can think of the different components of the input vector as being 
controlled by a single-user as in a single-user multiple-antenna communication sce- 
nario, but we can also think of each component as being controlled by different 
geographically separated users as, for example, in a multiple-user network where 
each of the users employs a single transmit antenna. We can also envision that the 
components of the transmitted vector are partitioned into disjoint groups where the 
different groups are controlled by geographically separated users. This corresponds 
to a network where the different geographically separated users may employ multiple 
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transmit antennas of various numbers. Finally, in our setup the different components 
of the input vector need not correspond to physically different transmit antennas. 
We can also envision a scenario where they correspond to transmissions taking place 
at different frequencies and/or times as in a network employing a slotted protocol. 
Analogous scenarios can be envisioned for the received vector. 

The various scenarios mentioned above differ not only in the allowed depen- 
dencies between the different components of the transmitted vector. It turns out 
that, at high signal-to-noise ratio (SNR), far more important is the structure of 
the multiplicative noise that they imply. For example, if a certain receive antenna 
and a certain transmit antenna operate at different time/frequency slots, then the 
corresponding component in the multiplicative noise matrix will be deterministi- 
cally zero. A similar situation occurs when a given transmitter is geographically 
very far apart from a given receiver as could, for example, be the case in a cellular 
system. For example, in Wyner's linear cellular model ^ j2] the transmitters are 
assumed to be uniformly spaced on a line, and each transmitter is received by only 
two base-stations: the base station to its left and the base station to its right. 

As we shall see, rather than the cooperation restrictions, it is these deterministic 
zeroes that will determine the high SNR asymptotic behavior of channel capacity. 
Very roughly, the main result of this paper is that, irrespective of the cooperation 
allowed, at high SNR the capacity of the channel C is given approximately by 

C ^ -log log SNR (1) 

where the non-negative integer k* can be computed combinatorially from the zeros 
of the multiplicative noise. 

The above result can be viewed as an extension of a result of |3] on multiple- 
antenna fading channels. In the multiple-antenna scenario where the components 
of the transmitted vector are geographically co-located and where the components 
of the received vector are also co-located, there are typically no deterministic zeroes 
in the fading noise. In this case it can be readily verified that our combinatorial 
expression for k* yields the value of 1, thus recovering the 1 Tog log SNR asymptotics 
of 0. 

The rest of the this paper is organized as follows. In the next section we describe 
the channel model and state the main result. In Section El we provide a proof, and 
in the final section. Section IH we summarize our results and discuss some possible 
extensions. 

2 Channel Model and Main Result 

The channel we consider is a discrete-time channel where the time-fc channel input 
Xfc G C""^ is an nx-dimensional complex vector, where /c G Z, is a discrete-time index 
taking value in the integers Z; nx is a positive integer; C denotes the complex field; 
and C""^ denotes the nx-dimensional complex Euclidean space. We refer to nx as 
the number of transmitters, and to the set 

T={l,...,nT} (2) 

as the set of transmitters. For every t G T we denote the t-th component of the 
time-/c input vector by Xk{t). This corresponds to the signal transmitted at time 
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k by Transmitter t. The time-k channel output G C"^ corresponding to the 
input Xfc is given by 

Yfc = MfcXfc + Z, (3) 
where is a positive integer that denotes the number of receive antennas and where 

7^ = {l,...,nR} (4) 

denotes the set of receivers. In the above, {Mk} is a matrix- valued stochastic pro- 
cess such that at every time instant k the random matrix M.^ is an x nx complex 
random matrix, and the random vectors {Z^} are independent and identically dis- 
tributed (IID), each taking value in C"^ according to an isotropic circularly sym- 
metric multivariate complex Gaussian law 

Zfc~Arc(0,l„J (5) 

where denotes the riR x identity matrix. (In general, W ~ Mcil-i, A) indicates 
that W — ^ is a zero- mean circularly symmetric complex Gaussian random vector of 
covariance matrix A.) We assume throughout that the processes {H^} and {Z^.} are 
independent and that their joint law does not depend on the input sequence {x^.}. 
Denoting by Hk{r,t) the row-r column-t entry of the matrix H^, and denoting by 
Zk{r) the r-th component of the time-fc additive noise vector Z^, we can re-write 
© as 

Yk{r) = J2 Mr, t)xk{t) +Zk{r), re 7^. (6) 

To account for the possibility that some of the components of the fading matrices 
might be deterministically zero we introduce the set Z 

Z cTZxT (7) 

where if {r,t) e Z then Hk{r,t) is deterministically zero at all times A; G Z: 

(r, t)eZ^ [Hkir, t) = 0, VA; G z) . (8) 

As for the other components, we shall assume a finite second moment 

E[|i7fc(r,t)|2] <oo, (r,^)G7^xT (9) 

and a finite differential entropy rate condition that we next describe. But first we 
introduce some notation. Given a collection of random variables indexed 
by a set A we denote, for any subset C ^, by W{B) the unordered collection 
{W{a)}a(z]3. With this notation and ^ we have that H^i^Z'^) is the collection of 
(= "^R ■ '^T ~ 1^1) random variables 

Hk{Z') = {Hk{r,t) : {r,t) ^ Z} (10) 

where we use Z'^ to denote the set complement of 2 in 7^ x T and we use | ■ | 
to denote set cardinality. The finite differentiable entropy rate condition that we 
require can be now stated as 

h{{H,iZ^)},^^)>-^. (11) 
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In the case where {M.k} is IID, this condition translates to the joint differential 
entropy of the (nR ■ — |^|) random variables {H^ir, t), (r, t) ^ Z} being finite. 
In the more general case, ()11|) can be written as 

\im -hi Hi{Z^),...,Hn{Z^)] > -oo (12) 

or even more explicitly as 

lim -/tf{i/fc(r,t)}, \<k<n, (r, t) ^ Z}) > -oo. (13) 

n— >oo 77. V / 

Notice that a stationary process {Hfc} simultaneously satisfies © and (|TT|l if, 
and only if, it simultaneously satisfies (jHI) and the two conditions 

/i(Hi(Z^)) > -oo and lim / (Hi, . . . , efc_i; H^) < oo. (14) 

We denote by Csu(^^) the capacity of this channel under full cooperation con- 
ditions when the input is subjected to the average power constraint E. That is, 

Csu(^)= lim -sup/(Xi,...,X„;Yi,...,YO (15) 
where the supremum is over all joint distributions on Xi, . . . , X„ satisfying 

ix:E[iix,n<f. 

fc=l 

This is thus the capacity when a single-user controls the input vector G C""^, 
and when a "super-receiver" has access to all the components of the received vector 
Yfc. Similarly, we define Csu,fb(^) as the single-user capacity but when there is a 
noiseless feedback link so that the time-fc transmitted signal X^ is allowed to depend 
not only on the message to be transmitted but also on all the past channel outputs. 
Clearly, Csu(^) < C'su,fb(^) because the feedback link can always be ignored. 

At the other extreme we define Cmac(^-) as the sum-rate capacity for this channel 
when it is viewed as a multiple-access channel (MAC) where the different compo- 
nents of the input vector are viewed as separate users who wish to communicate 
independent messages. Each user is assumed to be allowed a peak power of E. The 
assumption of a "super-receiver" continues to hold. (We shall later see that this 
assumption can be significantly relaxed.) We assume no feedback link. We thus 
have, 

Cmac(^) < Csu,FB(riT ■ E). (16) 

To state the paper's main theorem we need to introduce the notion of a "power 
chain". To define this concept we introduce the following notation. For any trans- 
mitter if: G T let 'R-t be the set of receivers that can "hear" it, z.e., 

7^t = {rG7^:(r,^)^Z}. (17) 

Analogously, for any receiver r G 7?., let % denote the set of transmitters that r 
"hears": 

% = {t^T ■■{r,t) iZ). (18) 
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Definition 1. We shall say that the K-tuple {ti, . . . jt^) G is a K-length power 
chain with respect to the set Z if 

Tit, ^ (19) 

and 

T^tA U ^^ = 2,...,/^:. (20) 

l<ri<u 

We can now state the paper's main result. 

Theorem 2. Consider a vector fading channel whose input takes value in C"'^ 
and whose output takes value in C"^. Let the set Z G TZ x T be given, where TZ and 
T are defined in (0} and (0) respectively. Assume that the stationary and ergodic 
matrix-valued fading process {Hfc} satisfies Q, and flll|) . Further assume that 
{Zfc} are IID according to (0), that the process {Z^} is independent of {M./^}, and that 
their joint law does not depend on the channel input sequence {x^}. Let Csv,fb{^) 
and CuAci^) defined as above. Then, 

^ |Csu,fb(^) - /€*loglog^| < oo (21) 
£-+00 1. J 

where k* = K*{nT, nn, Z) is the length of the longest power chain with respect to Z. 

If, additionally, {H^} has a Gaussian marginal, i.e., if the components of the ma- 
trix Mi (and hence, by stationarity, ofM.^ for any k) are jointly circularly symmetric 
and Gaussian, then 



lim 



I K* log log £ - Cmac(^)} < 00. (22) 



Moreover, in this Gaussian case, (j22|) is achievable with k* single-user scalar detec- 
tors. That is, there exist transmitters ti, . . . , t^* G receivers ri, . . . , r^* G TZ; and 
distributions for X under which the components of X are independent, under which 
the peak constraints \X{t)\ < S, t eT are satisfied almost surely, and such that 

n^|/€*loglog^-|^/(x(t,);r(r,))| <oo. (23) 

Note that since loglog(a^) — log log ^ converges to zero as ^ 00 with a > 
held fixed, it follows from ()l(ij) and from the theorem that 

lim {Csu,fb(^) - Cmac(^)} > 0. (24) 

>oo 

Consequently, we can loosely say that, at high SNR, the capacity of a Gaussian 
fading network is given by (^, where k* = K*{nT,n-[i, Z), irrespective of whether 
we impose individual peak power constraints or whether we impose combined av- 
erage power constraints, irrespective of whether we allow cooperation between the 
transmitters or not, and irrespective of whether feedback is available or not. 
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3 Proof of Theorem [21 



In this section we provide a proof of Theorem |21 We shall begin by showing that 
it suffices to prove the theorem in the case where the fading {H^} is memoryless, 
i.e., when {H^} are IID. We shall then separately prove the "converse" ()21j) and the 
"direct" part in the two corresponding subsections. 

Let then {M.^} be some stationary and ergodic fading process with memory 
satisfying Q & (HH), and let {Elfc} be an IID fading process of equal marginal so 
that the law of is the same as the law of Hi (which is the same, by stationary, 
as the law of for any A; G Z) . 

That it suffices to prove the converse in the memoryless case follows because, 
as shown by Moser jlj Chapter 8], the difference between the feedback-capacity of 
the channel with fading {Hfc} and the capacity of the channel with IID fading {H^} 
is bounded in the SNR. For the sake of completeness we repeat Moser's result in 
Appendix El 

As to the direct part, we note that the capacity of the channel with fading 
{Hfc} cannot be smaller than that of fading {Elfc}. Indeed, if Q is any distribution 
on C"''^ then the mutual information on the memoryless channel of fading {H^} is 
achievable on the channel of fading {Mk} by considering inputs Xi, . . . , X„ that are 
IID according to Q. Indeed, for such IID inputs 



1 1 " 

-/ (X^; Y^) = - ^ /(Xfc; Y^|xn 

k=l 

k=l 
1 " 

> - V/(Xfc;Yfc) 



n 

k=l 



= /(Xi;eiXi + Zi) 
= /(Xi;eiXi + Zi) 

where the first equality follows from the chain rule; the subsequent from the indepen- 
dence of Xi, . . . , X„; and the subsequent inequality because reducing observations 
cannot increase mutual information. Here we use X™ to denote X^, . . . , X^,. 

We shall thus proceed to prove the theorem assuming that the fading is mem- 
oryless. In this case we shall omit the time index so that our assumptions on the 
fading process can be now written as: 

E [\H{r, t)\^]< oo, (r, t) G 7^ X T (25) 

h{H{Z')) > -oo (26) 

(r, t) e Z ^ (^H{r, t) = 0, almost surely) . (27) 
We shall further assume that none of the rows of EI is deterministically zero, i.e., 

Wen 3t eT : {r,t) i Z (28) 
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or equivalently, 



r en. 



(29) 



This corresponds to the condition that every receiver "hears" at least one transmit- 
ter. Analogously, we shall assume that none of the columns of EI is deterministically 
zero, i.e., 

yteT 3r eTZ: {r,t) ^ Z (30) 

or equivalently 

7^i 7^ 0, ter. (31) 

This corresponds to the condition that every transmitter is heard by at least one 
receiver. The above assumptions can be made without loss of generality because a 
receiver that hears no signals (other than ambient additive noise) does not affect the 
longest power chain and can also be ignored at the detector. Similarly, a transmitter 
that cannot be heard by any receiver will never be an element of a power chain and 
there is also no point in having it transmit any signal. 

3.1 The Converse 

In this section we provide a proof of (|7T|) for IID fading {H^.} satisfying (j^ . ^I^. 
and ()27p. We begin by considering the "ordering permutation" cr(x) of a given 
nT-tuple X e C"'^. This is the permutation that orders the components of x in 
descending order of their magnitudes, resolving ties with preference to lower indices. 
Thus, given an riT-tuple x G C"'^ we set cr(X) to be the permutation r : u \—>- t„ on 
T that satisfies 

|a;(ri)| > |a:(r2)| > •■■ > |a;(r„J| (32) 
and that resolves ties in favor of lower indices so that 

\x{t^) \ = |x(r^+i)| ^ < r^+i. (33) 

The form in which ties are resolved does not play an important role in our analysis. 
It is made here explicit because it is important that x G C""^ determine the ordering 
permutation (t(x) uniquely. 

If X is a random vector taking value in C"'^, then its ordering permutation cr(X) 
is a random permutation. Since the number of permutation on T is wt', it follows 
that, irrespective of the distribution of X, the entropy of cr(X) is upper bounded by 

H{a{X)) < \ogm\ . (34) 

Given any channel input X we can thus expand the mutual information /(X; Y) 
between the channel terminals as: 

/(X;Y) = /(X,a(X);Y) 

= /(a(X);Y)+/(X;Y|a(X)) 

< H{a{X))+l{X-Y\aiX)) 

< lognx! + /(X; Y|a(X)) 

J2 /(X;Y|a(X) = r)Pr[a(X)=r]+lognT! (35) 

r:Pi(o-(X)=r]>0 
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The proof of the converse will now focus on the terms of the form 

j(X;Y|a(X)=r) 

where r is an arbitrary permutation satisfying 

Pr[cT(X) = r] > 0. 

Fix then such a permutation r and let 

^, = E[||X||V(X) = r]. (36) 

We will show that corresponding to the set Z and to the permutation r there is a 
power chain of length k = t) such that 

/(X; Y|a(X) = r) < k ■ log(l + log(l + S^)) + c (37) 
< /€*-log(l + log(l + f,)) + c (38) 

where the constant c depends only on the law of H and on the permutation r but 
not on the power Sr- 

Note that once we establish (jHH|) . the converse will follow from & and 
Jensen's inequality by the concavity of the double-logarithmic function. We thus 
proceed to construct the power chain and to then prove (jHTj) . 

To simplify the typesetting describing the construction of k{Z, t) we shall use 
[p] for T,^. Thus, conditional on a(X) = r we have that -^([1]) has the maximal 
magnitude among all the elements of X, and X([nT]) has the smallest magnitude. 

Let ji = 1. Assume that we have defined ji, . . . We then define j^+i as 

= min I J, < £ < nx : 7^[,] \ Q 7^[,•,] ^ 0| (39) 

where the minimum of an empty set should be understood as oo. We then set 

K = max {1 < ly < riT ■ ju < oo} (40) 

and define 

u-l 

U = [ju] Bu = ntA[jT^[t,h iy = l,...,K. (41) 

r,=l 

Thus, tu is the next strongest transmitter after t^-i that can be heard by some 
receiver that is uninfluenced by any of the stronger transmitters that are already 
in the chain. The set is the set of receivers that can hear t^, but not any of the 
stronger transmitters that are in the chain. Note that by (jHT|) we have TZtj^ 7^ 0. In 
fact, (ti, . . . , t^) is a power chain with respect to Z, so that 

K < K*. (42) 

(Recall that k* is the length of the longest power chain with respect to Z.) Also 
note that the sets {B^,} are disjoint and that by their union is 71, i.e., they 
form a partition of TZ: 

K 

n=[jBu Bur\Bu> = ^ whenever I < v ^ v' < (43) 

v=l 
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Finally, we define 

A = {bV],...,bV+i-l]} z/ =!,...,«:- 1 (44) 

and 

A = {[jK],...,[nT]}. (45) 

The key properties of the constructions of k(2, r), of the power chain (ti, . . . , t^), 
of the collection {B,^}, and of the collection {A^} 

dbTQ clS follows. The K-tuple 
(ti, . . . j^k) is a power chain, so that n < k*, (fl^ . The collections {B^} and {Au} 
are partitions of TZ and T respectively. And conditional on ct(X) = r the ran- 
dom variables X^A^) only influence ^(Uj^^j^i?^); they do not influence any receiver 
in 7^ \ Uj^^j^S^. That is, conditional on cr(X) = r and on the random variables 
X{Au U ■ ■ • U Ak), the random variables X{Ai U • ■ ■ U A^-i) are independent of the 
random variables Y{B,y). 

Using these properties, we next prove (jSZj)- The key will be the following lemma: 

Lemma 3. Let M. be a random x nx complex matrix whose components are all 
of finite second moment 

E[\H{r,t)\^] < oo, (r,t) G 7^ X r 

where 71 = {1, . . . , wr} and T = {1, . . . , wt}- Let the set Z C TZ x T be the set of 
pairs (r, t) such that H{r,t) is deterministically zero: 

H{r,t) = almost surely V(r, t) G Z. 

Assume that the joint differential entropy of the coordinates that are not in Z is 
finite 

h{E{Z^))> -oo. (46) 

Lett* G T be fixed. Assume that Transmitter t* influences all receivers in the sense 
that 

{r,t*)^Z, VrG7^. (47) 

Let l^be a random vector taking value in C""^ whose component of largest magnitude 
is almost surely t* : 

ni5^|X(t)| = |X(r)|, almost surely. (48) 

Assume the average power constraint 

J2^[\X{t)\']<S. 
teT 

Finally, let Z take value in C""^ according the multivariate Gaussian law jVciO, In^) 
and assume that H, Z, and X are independent. 

Then there exists some constant c, which depends on the law o/H but not on the 
law ofX. or on its power S, such that 

/(X; ex + Z) < log(l + log(l + £)) + c. (49) 
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Proof. Let Y 



HX + Z and let 
I? = {x G I 



: max \x{t) \ = \x{t*)\} 

l<t<nT 



(50) 



so that ()48|1 can be rewritten as Pr[X G P] = 1. 

The proof of the lemma is very similar to the proof of [3, Theorem 4.2]. It too 
is based on the bound Eq. (333)] : 

/(X;Y)<log7r"^-logr(nR) 

+ Ex KE [log l|Yf |X = x] - /i(Y|X = x)] 
+ a(l + logE[||Y]|VE[log||Y||2]) 
+ log r (a) — a log a, a > 0. (51) 
From this inequality it follows that for inputs X satisfying (pSjl 

/(X;Y)<log7r"«-logr(nR) 

+ sup{nRE[log]|Y|nX = x] -/i(Y|X = x)} 

+ a(l + logE[||Yf] -E[log||Yf]) 

+ logr(a) -aloga, a > 0, Pr[X G P] = 1. (52) 

We now proceed to analyze the various terms in the above. We begin with 
showing that the supremum, which does not depend on is finite 



sup {nRE[log II Y] 



|X = x] -/i(Y|X = x)} < oo. 



(53) 



To this end we use Jensen's inequality to obtain: 

nRE[log||Y|nx = x] <nRlogE[||Y|nx = x] , xgC 

= nRlog (E[||eXf |X = x] + E[||Zf ]) , x G C"^ 

< riR log (E [Pill] -11x112 + tir), xgC"- 

<nR log (E [Pill] .nT-\xif)\^ + nn), x G I? (54) 

where the second inequality follows from the Cauchy — Schwarz Inequality with 
Pill = |if(r, t)p denoting the squared Frobenius norm, and where the last 
inequality follows by restricting x to be in the set V where ||x|p < nT\x{t*)\'^. 

As to the differential entropy term in we obtain two separate bounds. The 
first is useful when ||xp is very small and is otherwise quite crude 



h(Y\X 



x) = /i(ex + Z) 
> h{Z) 

= nRlog(7re). 



xG C 



(55) 



The second is 

/i(Y|X = x) > /i(ex 



{H{r,t)}ren, teT\{t*}^ 
h(^{H{r',t*)-xit*)y,^n 
n^\og\xit*)\' + h(^{Hir',t*)}r>en 
nnlog\x{t*)\' + h(^{H{r\t*)}r>^T^ 



{H{r,t)}ren, teT\{t*} 

{H{r,t)}reTZ, ter\{t*} 

{H{r,t)}t(^T\{t''}, (r-Mz 
\x{t*)\ > 0. 



(56) 
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Here the first inequality follows because conditioning cannot increase differential 
entropy; the subsequent equality by expressing Hx as 



Hx 



(EteTH{l,t)x{t)\ 

I Hil, t*)x{t*) \ I Eter\{t*} Hih t)x(t) \ 



+ 



\H{nn,t*)x{t*)J 



\EteT\{t'} H{nR,t)x{t) J 



by noting that conditional on {H{r,t)}r(=n,teT\{t*} the second term on the right 
is deterministic, and by noting that the addition of a deterministic vector does 
not affect a vector's differential entropy; the next equality from the behavior of 
differential entropy under scaling; and the final equality because it is pointless to 
condition on deterministic random variables. Note that guarantees that the 
RHS of dnni) is finite. 

Inequalities ()55|) and combine to prove ()53|) . The analysis of the other 
terms in ()52j) and the choice of a = a{S) in is identical to the analysis in [3 
Appendix II]: 



logE[||Yf] =log(E[||eXf]+E[||Zf]) 



< log (E [lief] E[||Xf] 

<log(E[||e|||]f + nR) 



E[iizir]) 



(57) 



E[log||Yf] = E[log||eX + Zf] 
>E[log||Zf] 



(58) 
(59) 



a*= (l + logE[||Yf] -E[log||Yf])-' 
where a* | with the SNR. See jSl Appendix II] for the details. 



(60) 
□ 



Note 4. The Gaussianity of the noise in the above lemma is not crucial. As in 
Appendix II] the result continues to hold whenever E[||Zf ] < oo zs of finite second 
moment and of finite differential entropy. 

With the aid of this lemma we can now prove (jSZj)- We shall upper bound 
/(X; Y|(t(X) = t) in k phases. In the first phase we shall upper bound this mutual 
information by a double-logarithmic term, a constant, and another mutual infor- 
mation term. This latter mutual information term will be upper bounded in the 
second phase by a double- logarithmic term, a constant, and yet another mutual in- 
formation term, which is then upper bounded in the third phase. In the final phase, 
Phase K, we upper bound the mutual information by a double-logarithmic term and 
a constant only, thus terminating the calculation. Since each phase contributes a 
double-logarithmic term, the k phases contribute together a K-loglog^,-, as required. 
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The details now follow. In Phase 1 we expand mutual information using the 
chain rule 

/(X; Y|a(X) =r)= /(X(T); F(7^) |(t(X) = r) 

= /(X(T);m),m)k(X) = r) 
= /(X(r);m)k(X)=r) 

+ l{X{Ty,Y{B'i)\Y{B^),a{X) = r). (61) 

The first term on the right of the above is easily treated using the lemma, because 
conditional on cr(X) = r, the component X(ti) (= X([l])) is of largest magnitude, 
and it is heard by all the receivers in Bi. Consequently, we have by Lemma 121 

l{X{T);Y{B,)\a{X) = r) < log(l + log(l + ^,)) + Ci (62) 

where the constant Ci is as in Lemma El independent of the SNR. 

As for the second term on the RHS of (|?)T|l we use the chain rule once again to 
obtain 

/(X(r);m)|m),a(X) = r) 
</(X(T),m);m)|a(X)=r) 

= l{X{Ty,Y{Bt)\a{X) = T)+l{Y{B^y,Y{B'i)\X{T),a{X) = r) 
= /(X(^^);r(i3J)|a(X) = r) +/(r(i3i);r(SJ)|X(T),a(X) = r) 

< l{XiAiy, YiB1)\aiX.) = t)+I ({Hir, t)} ter ; {if(r, t)} ter ) (63) 

Here the last inequality follows by the data processing inequality, and the preceding 
equality follows because Y{Bl) is conditionally independent of X{Ai) given Xi^Al). 

Thus, we have by (pT|) and (jU^ that the original mutual information term is 
upper bounded by a double-logarithmic term, a constant term, and another mutual 
information term: 

/(X;Y|a(X) =r) < log(l + log(l + ^.)) 

+ ci + / ({H{r, t)} ter ; {H{r, t)} t^r \ 

+ l{X{AiyY{Bl)\a{X) = T). (64) 

The mutual information term 

l{X{A\yY{Bl)\a{X) = r) 

on the RHS of the above is now upper bounded in Phase 2. Notice that this term 
corresponds to a "smaller" fading channel where the inputs Ai are immaterial, as 
are the outputs Bi. In Phase 2 we thus upper bound this term as follows: 

l{X{A\y Y{B\)\a{iq = t)= l{X{A\y Y{Bl n S2), Y{B\ n Bl)\a{X) = r) 

= l{X{A\yY{BlnB2)\aOq = t) 

+ l{X{AiyY{BlnBl)\Y{BlnB2),a{iq = T). (65) 
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The first term can be bounded using the lemma because X{t2) is the component of 
X{A1) of largest magnitude, and it is heard by all receivers in Bin 82'- 

l{X{Al);YiBl n B2)\(r{X) = r) < log(l + log(l + Sr)) + C2, 

for some constant C2. 

The second term in can be expanded in analogy to (jUHj) to yield 

l{XiA'i);YiBl n B'2)\YiBl n i32),a(X) = r) 
< I{X{AI), Y{Bl n YiBl n |a(X) = r) 

= l{X{Aiy,Y{Bl n i3^)|a(X) = r) n B2);Y{BI n i3^)|X(^5),(T(X) = r) 

= /(X(^^n^^2);mni3^)|a(X) = r)+/f{i/(r,t)} ;Wr,t)} t^^c ). 

The mutual information term 

liXiAl n A'i);YiBl n S^)|a(X) = r) 

is now upper bounded in Phase 3. This process is continued until the final phase, 
Phase K, when the term 

I{X{AI n ■ ■ ■ n AUy,Y{Bl n • ■ ■ n b:_,)) = /(x(A); Y{B.)) 

is upper bounded using the lemma by a double-logarithmic term and a constant 
without an additional mutual information term. Indeed, the component X{tf^) is of 
largest magnitude among the terms in X{Ak) and it influences all the receivers in 
B.. 

It is thus seen that performing a total of k phases yields the bound (|H7jl and 
hence, by (j^ . also (jHHj) . The converse now follows from (jHHj) and using Jensen's 
inequality because the double-logarithmic function is concave and because, in view 

of dSHD, 

PrKX)=r].^. = E[||Xf]. (66) 

T:Pi(cr(X)=r]>0 

3.2 The Direct Part 

To prove the direct part we shall demonstrate that, under the Gaussian marginals 
assumption, if (ti, . . . , t^) £ '^'^ is any power chain with respect to Z then we can find 
a distribution on X under which its components are independent (thus guaranteeing 
achievability under multiple-access conditions) and such that 

|K-loglog^-/(X;Y)| < cx). (67) 

The proof that this mutual information is achievable with scalar single- user detectors 
will be described separately. 

Fix then some such power chain (ti, . . . , is) £ T^. Consider now a distribution 
for X under which the components of X are independent with marginals that can 
be described as follows. If some t G T is not in {ti, . . . ,ti^}, we set X(t) to be 
deterministically zero 

t^{ti,...,t^}^{X{t) = 0, a.s.). (68) 
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As to the other components of X, we choose them to be circularly symmetric with 
squared magnitudes whose logarithms are uniformly distributed on an interval that 
will be described later: 

log \X{U)\^ ~ Uniform (^logx^i^^^, logx^^^^^) , u = 1,...,k. (69) 

Here 

< 2;Ln,i. < 2;Lx,^ < logf , U = 1,...,K (70) 
will be specified later. (See (jU^ & (jHH).) Note that with this choice of the marginals, 

/z(log|X(t,)|2) =loglog^, z/ =!,...,«:. (71) 

Since (ti, . . . ,ti^) is a power chain, it follows that for every 1 < u < k we can 
find a receiver such that 

rj=l 

Thus, Receiver can hear Transmitter 

{r,,t,)iZ (73) 

but it is uninfluenced by the transmitters ti, . . . , t^-i 

(r„gGZ, r7 = l,...,z/-l. (74) 

It may be influenced by transmitters ti^+i, • • . but those, as we shall see, will be 
chosen to have powers that are much smaller than the power assigned to Transmit- 
ter ty. 

The mutual information /(X; Y) can be now lower bounded as follows: 
I{y.-Y) = I {{X{K)}:^,-Y) 

K 

= ^/(X(t.);Y|{X(g}:;=,+i) 

K 

> ^/(X(t.);r(r,)| {X(t,)};^,^0 • (75) 

Here the first equality follows by (jHH|l : the second by the chain rule; and the subse- 
quent inequality by restricting the set of observables in each of the terms. 
We shall next show that with a judicious choice of the constants 

■{•^min,!^}; {"^max,!/}) ^ 1, . . . , /€ 

in we can guarantee that each of the k, terms in ()75|1 grows double-logarithmically 
in the SNR. 

The term that is easiest to deal with is the term I{X{tf^)] Y{rf^)). It corresponds 
to the mutual information across the terminals of a Ricean fading channel with 
additive Gaussian noise: 

Y{r,)=H{r^,QX{Q + Z{r,). 
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Indeed, by our choice of r^, non of the transmitters ti, . . .ts-i influences it (jZlj), 
and the other transmitters were chosen deterministically zero ()68|). This mutual 
information term can therefore be handled using the results from ^3, on the Ricean 
fading channel. 

The other terms, however, are more complicated. Consider the expression 

/(X(t,);F(r,) = x(Wi),...,X(y =x(y) (76) 

for some 1 < u < k. By fl(i8|l and (f7^ it follows that we can express Y{r^) as: 

Y{r,) = H{r,, U)X{U) + W{r,) (77) 

where 

K 

W{r,) = H{r„Qx{Q + Z{r,), 1 < u < k. (78) 

ri=u+l 

Since the components of EI are jointly Gaussian, it follows that under the condition- 
ing in (fTUI) . the pair {H{r^, t^), W{r^)) are jointly Gaussian. They are not, however, 
independent because the components of EI may be dependent. We thus need to 
analyze the mutual information across the Ricean fading channel when the additive 
noise and the multiplicative noise are jointly Gaussian and dependent. The following 
lemma does just that. 

Lemma 5. Let the pair of complex random variables {H, W) be jointly Gaussian, 
and assume that the pair is independent of the complex random variable X . Assume 
that X has a finite second moment and that it is of finite differential entropy. Then, 



I{X; HX+W) > /i(X)-E [log |Xp] +E [log \H\^] -E 



\og(ne{aH + T^Y^ 



. (79) 



where a^,,ajj > are the variances of W and H respectively. Consequently, if the 
magnitude of X is almost surely larger than the positive constant Xrain > 0, then 



I{X; HX + W)> h{X) - E [log \X\^] + E [log \H\^] 



log [Tie [an 



\X\ > 



a.s.. (80) 



//, additionally, X is circularly symmetric, then 
I{X; HX + W)>h (log + log vr + E [log \H\'^] 



log vre (Th + 



aw 



\X\ > Xmin, circ. sym.. (81) 



Proof. First note that the assumptions that X has a finite second moment and 
finite differential entropy guarantee that the logarithm of its magnitude is of finite 
expectation O Lemma 7.7] so that the lemma's claim is meaningful. 
The proof proceeds by expressing I{X; HX + W) as 



I{X; HX + W) = h{HX + W)- h{HX + W\X) 



(82) 
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and by then bounding the terms on the RHS. We begin with the first: 



h{HX + W)> h{HX + W\H) 
> h{HX\H) 
= h{X) + E[\og\H\^] 



^3) 



where the first inequahty follows because conditioning cannot increase differential 
entropy; the second because conditional on H the random variables X and W are 
independent; and the subsequent equality from the behavior of differential entropy 
of complex random variables under deterministic scaling. 

As to the other term in ()82j). we note that conditional on X = x, the random 
variable HX + W is Gaussian. Hence, 



h{HX + W\X) = E [log +h(^H+^ 
= E[log|X|2]+E 

< E[log|X|2] +E 



X 



w 

log Tce ■ Var ( + ^ 



X 



log vre (Jh + 



\X\ 



where aj^ and the respective variances of H and W. 

Combining (j83|l and ()84|1 with (j82|l yields (|79|l . which combines with the mono- 
tonicity of the logarithm function to imply (jED))- Finally, to obtain (jHTjl we note that 
if X is circularly symmetric then 



h{X) - E[\og\X\^] = /i(log|X|2) +log7r 
which follows, for example, from jSl Eqs. (320) & (316)]. 



(85) 
□ 



To apply the lemma to the analysis of (f76|l - (f78|l we need an estimate on the 
variance of W{r,^). But such an estimate can be readily found using the Cauchy — 
Schwarz inequality. Under the conditioning in ()76j) . we have from ()78j) 



Var(l^(r,)) < E[\Wir,)\^] 

= E[\Z{r^)\']+E 



r]=u+l ri=v+l 

2 



< 1 + E[||H|||] ■ (k- z/) max 



p<rj<K 



max,r;' 



^6) 



It thus follows from Lemma El and from (|HF)|l that the mutual information in (|7H|l 
will satisfy 



lim {loglog^ - I{X{U)- Y{r,) \ X(t,+i) = x(t,+i), . . . ,X(t,) = x(t«)) | < 



OO 
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uniformly over all {x(t^), = u + 1, . . . , k} satisfying 

■^min,77 ^ — •^max,riy ^7 -|- 1, . . . , 



whenever both 



lim — - — -75- — ; — ^^^^^ = 00 (89) 

£-00 1 + E[||e|||,] -(k-u) max^<^<^xLx,„ 

(so that the last term on the RHS of (|HT|) tends to zero) and 

lim < log log £^ — log log ^'^'^''^ > < 00 (90) 

(so that by (ffT|) the first term on the RHS of (jHlll has the right asymptotic growth.) 
Since fl89p and (jHIH) guarantee uniform convergence in (jHTj) it follows that they also 
guarantee that 

in^|loglog^-/(X(t,);y(r,)|X(Wi)...,X(y)| <oo, u = 1,...,k (91) 
£'—+00 L J 

and hence, by (f73j). they also guarantee that 

IhK|K-loglog^-/(X;Y)| < 00 (92) 

£—*oo L J 

as we had set out to prove. 

To conclude this part of the proof it is thus only required to find choices for 
{xmin,r],Xma.x,ri}'^=i that wiU guarantee that both (jHEI) and (jHOI) hold. An example of 
such a choice is: 



X 



max, 77 



£'/\ r] = l,...,K (93) 



X 



mm,r) 



^i/('?+i)log^^ r/ = !,...,«:. (94) 



Having established the achievabihty of (|22|) we now set out to prove that this 
can also be achieved using scalar single- user detectors. We do so by showing that, 
with our choice of X, ()9H1 implies that 

liS^ |loglog^-/(X(t,);F(r,))| < 00, u=1,...,k. (95) 

Since l(^X{ty)]Y{r^)) is achievable with a single-user detector, this will conclude 
the proof. To prove that implies we will show that 



lim {I{X{U); Y{r,) \ X(t,+i), . . . , X{Q) - l{X{U); Y{r,)) } = 0, u 



(96) 



To this end we upper bound this difference as: 

/(X(t,);F(r.) I {X(t,)},<,<«) -/(X(t,);F(r,)) 

= I{X%)- {X(t,)},<,<., r(r,)) - I{X%)- Y{r,)) 
= j(X(t,);{X(g},<,<.|r(r,)) 
</(X(t,),F(r,);{X(g},<,<.) 

</(X(t,),r(r,),{iJ(r„t^)},<^<«; {X(t^)},y<^<Kj 

= /({X(t,)},<^<,;F(r,) I {i7(r„t,)},<,<«,X(t,)). (97) 
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Here the first equality follows from the independence of X{t^) and {X(trf)}u<ri<K,', the 
second equality from the chain rule; the subsequent inequality from the chain rule 
and the nonnegativity of mutual information; the subsequent inequality by adding 
observations; and the last equality because {X(t^)}i,<^<K is independent of the pair 
(X(tj,), {H{r^,tj^)},y<^ri<K)- Conditional on 

the distribution of H{ry,ty) is Gaussian with some variance > 0, which does not 
depend on the realization of {H{ry,tj^)}y^ri<K- (Note that by (pUj) and (fTSj) is 
strictly larger than zero.) Consequently, if we additionally condition on X{ty) = 
x{ti,) we obtain that the mutual information between Y{r^) and {X(t^)}y<^<K is 
the same as the mutual information between {X(t^)}jy<r;<K and Y where 

K 

Y= J2 h{r,,t,)X{t,)+W (98) 

rj=u+l 

and where 

is independent of {X(t^)},^<^<K. Now 



J2 H{r,,t^)X{tr;, 



rj=u+l 



J2 \H{r^,t,)\'E[\X{Q\'] (99) 

ri=u+l 

because the components of X are independent and circularly symmetric (and hence 
of zero mean). By the data processing inequality and the Gaussian channel capacity 
formula we now obtain: 



l{{X{tn)}^<n<^;Y{r,,) \ {H{r^,tn)}„<rj<K, X{t^)) 
< E 



, , , , E;=.+iig(r.,gpE[|x(t,) 



< E 



< lot 



log 1 + 



^ E:=.+i|g(r.,gpE[|x(t 



1 4- f:2 . ^2 

E[|i7(r.,gp]E[|X(gp] 



1 -L f:2 . ^2 



< log I 1 
^ 



l + e^ -x 



(100) 



where the second inequality follows because |X(t,^)| > x^iri,v with probability one; 
the third inequality follows by Jensen's inequality; and the final limiting behavior 
follows because we have chosen {xmin,,,, a;max,??} so that (jH^ hold. 
The upper bound (fUTjl and ()100|1 now combine to prove (jHEI)- 
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4 Discussion and Summary 



In this paper we considered non-coherent fading networks with additive and mul- 
tiphcative noises. We have shown that, at high SNR, the capacity of the network 
grows hke an integer multiple of log log SNR. This integer multiple is determined 
by the location of the deterministic zeroes of the fading matrix. Loosely speaking, 
this integer can be viewed as the effective number of parallel channels that can be 
supported by the network, i.e., as the maximal number of point-to-point single-user 
scalar channels that can be supported by the network in a manner that will allow, 
with proper power allocation, negligible cross-interference. 

It is felt that this integer is an important parameter of the network, but that far 
more parameters are needed to obtain more precise approximations of the system's 
throughput. 

It has been pointed out to me by Shlomo Shamai that in some broadcast scenarios 
the fading levels experienced by the different users may be highly correlated so that 
the assumption that the non-zero components of the fading matrix are of finite joint 
differential entropy may be violated. Such scenarios can be nevertheless sometimes 
addressed using our results by noting that in broadcast scenarios the achievable 
rates are determined by the marginals of the network law Thus, in some such 
scenarios one can replace the fading matrix with a fading matrix whose rows are 
independent, but such that each row is of the same law as that in the original matrix. 



A On Memory and Feedback at High SNR 

In this appendix we show that if the stationary and ergodic fading process {M.k} 
satisfies (jHl) and (fT^ then the difference between the channel's capacity and the 
capacity of the memoryless channel of IID fading {H^}, where the law of Mfe is 
identical to the law of Hi, is bounded in the SNR. 

This proof is taken almost verbatim from |i] and is included here only for the 
sake of completeness. We denote the message to be transmitted by M, and we 
assume that the time-k transmitted input is now a function of M and of the 
previous outputs Y^"^. The proof, as in for example |5i Section 8.12], is based on 
Fano's inequality and on an upper bound on ■ /(M; Y"). 



-I{M;Y^) 
1 " 

n ^-^ ' 

k=l 

= ^E(^(^'Yr'Y,)-/(Yr;Y, 

k=l 
n 

k=l 
n 

<-^/(M,Yf-\et-i;Y,) 



(101) 
(102) 
(103) 
(104) 



k=l 
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1 

= - V/(M,Yt-\e^\X,;Yfc) (105) 
fc=i 

= - ^ (/(Ht-\Xfc;Y,) +/(M,Y^i;Yfc|et-\X,)) (106) 

fc=i 
1 " 

= -Vj(H^\Xfc;Yfe) (107) 
fc=i 
1 " 

= - J] (/(Xfc;Yfc) +/(Ht-^Y,|Xfc)). (108) 
fe=i 

Here the first two equalities follow from the chain rule; the subsequent inequality 
from the non-negativity of mutual information; the following inequality from adding 
random matrices; the subsequent equality follows since X^ is a deterministic function 
of M and Y^~^; then we have used the chain rule again; ()107j) follows since 

J(M, Yt'; Yfe I m1~\ X,) = 0; (109) 

and finally we have used the chain rule once more. 

The term /(X^; Y^) does not depend on the memory in the fading process and 
is thus identical for {H^} and for {H^}. As for the other term, we upper bound it 
as follows: 



/(M^^;Yfc|X,) < J(et-^Yfe,H,|X,) (110) 

= l{m'l-';Mk I Xfe) +/(e^^; Yfc I Xfc,H,,) (111) 

= /(et-^;efc|Xfc) (112) 

</(et-\Xfe;H,) (113) 

= /(efc;et-i) +/(efc;Xfc|e^-^) (ii4) 

= l{mk;Ml )■ (115) 



The feedback capacity of the channel with fading {Hfc} can thus exceed the capac- 
ity of the memoryless fading channel with equal-marginal fading {H^} by at most 
/(Hfc; IH^"^), which does not depend on the SNR and which, by assumption (|T^ . is 
finite. 
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